روش GRU-LSTM

نتایج جستجو برای: روش GRU-LSTM

تعداد نتایج: 377112 فیلتر نتایج به سال:

LSTM, GRU, Highway and a Bit of Attention: An Empirical Overview for Language Modeling in Speech Recognition

2016

Kazuki Irie Zoltán Tüske Tamer Alkhouli Ralf Schlüter Hermann Ney

Popularized by the long short-term memory (LSTM), multiplicative gates have become a standard means to design artificial neural networks with intentionally organized information flow. Notable examples of such architectures include gated recurrent units (GRU) and highway networks. In this work, we first focus on the evaluation of each of the classical gated architectures for language modeling fo...

متن کامل

Internal Memory Gate for Recurrent Neural Networks with Application to Spoken Language Understanding

2017

Mohamed Morchid

Long Short-Term Memory (LSTM) Recurrent Neural Networks (RNN) require 4 gates to learn shortand long-term dependencies for a given sequence of basic elements. Recently, “Gated Recurrent Unit” (GRU) has been introduced and requires fewer gates than LSTM (reset and update gates), to code shortand long-term dependencies and reaches equivalent performances to LSTM, with less processing time during ...

متن کامل

High Order LSTM/GRU

2016

Wenjie Luo

RNN is a powerful model for sequence data but suffers from gradient vanishing and explosion, thus difficult to be trained to capture long range dependences. But people have proposed LSTM and GRU, which try to model the differences between adjacent data frame rather than the data frame itself. By doing so, it allows the error to back propagate throw longer time without vanishing. Also instead of...

متن کامل

An Empirical Exploration of Recurrent Network Architectures

2015

Rafal Józefowicz Wojciech Zaremba Ilya Sutskever

The Recurrent Neural Network (RNN) is an extremely powerful sequence model that is often difficult to train. The Long Short-Term Memory (LSTM) is a specific RNN architecture whose design makes it much easier to train. While wildly successful in practice, the LSTM’s architecture appears to be ad-hoc so it is not clear if it is optimal, and the significance of its individual components is unclear...

متن کامل

Minimal Gated Unit for Recurrent Neural Networks

Journal: :CoRR 2016

Guo-Bing Zhou Jianxin Wu Chen-Lin Zhang Zhi-Hua Zhou

Recently recurrent neural networks (RNN) has been very successful in handling sequence data. However, understanding RNN and finding the best practices for RNN is a difficult task, partly because there are many competing and complex hidden units (such as LSTM and GRU). We propose a gated unit for RNN, named as Minimal Gated Unit (MGU), since it only contains one gate, which is a minimal design a...

متن کامل

Efficiently applying attention to sequential data with the Recurrent Discounted Attention unit

Journal: :CoRR 2017

Brendan Maginnis Pierre H. Richemond

Recurrent Neural Networks architectures excel at processing sequences by modelling dependencies over different timescales. The recently introduced Recurrent Weighted Average (RWA) unit captures long term dependencies far better than an LSTM on several challenging tasks. The RWA achieves this by applying attention to each input and computing a weighted average over the full history of its comput...

متن کامل

Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling

Journal: :CoRR 2014

Junyoung Chung Çaglar Gülçehre Kyunghyun Cho Yoshua Bengio

In this paper we compare different types of recurrent units in recurrent neural networks (RNNs). Especially, we focus on more sophisticated units that implement a gating mechanism, such as a long short-term memory (LSTM) unit and a recently proposed gated recurrent unit (GRU). We evaluate these recurrent units on the tasks of polyphonic music modeling and speech signal modeling. Our experiments...

متن کامل

Recurrent Discounted Attention

2017

متن کامل

Prototypical Recurrent Unit

Journal: :CoRR 2016

Dingkun Long Richong Zhang Yongyi Mao

The difficulty in analyzing LSTM-like recurrent neural networks lies in the complex structure of the recurrent unit, which induces highly complex nonlinear dynamics. In this paper, we design a new simple recurrent unit, which we call Prototypical Recurrent Unit (PRU). We verify experimentally that PRU performs comparably to LSTM and GRU. This potentially enables PRU to be a prototypical example...

متن کامل

Effective Quantization Approaches for Recurrent Neural Networks

Journal: :CoRR 2018

Md. Zahangir Alom Adam T. Moody Naoya Maruyama Brian C. Van Essen Tarek M. Taha

Deep learning, Recurrent Neural Networks (RNN) in particular have shown superior accuracy in a large variety of tasks including machine translation, language understanding, and movie frames generation. However, these deep learning approaches are very expensive in terms of computation. In most cases, Graphic Processing Units (GPUs) are in used for large scale implementations. Meanwhile, energy e...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید